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Abstract: Microsatellites have been widely used in studies on population genetics, ecology and evolutionary biology. 
However, microsatellites are not always available for the species to be studied and their isolation could be 
time-consuming. In order to save time and effort researchers often rely on cross-species amplification. We revealed a new 
problem of microsatellite cross-species amplification in addition to size homoplasy by analyzing the sequences of 
electromorphs from seven catfish species belonging to three different families (Clariidae, Heteropneustidae and 
Pimelodidae). A total of 50 different electromorphs were amplified from the seven catfish species by using primers for 4 
microsatellite loci isolated from the species Clarias batrachus. Two hundred and forty PCR-products representing all 50 
electromorphs were sequenced and analyzed. Primers for two loci amplified specific products from orthologous loci in all 
species tested, whereas primers for the other two loci produced specific and polymorphic bands from some 
non-orthologous loci, even in closely related non-source species. Size homoplasy within the source species was not 
obvious, whereas extensive size homoplasy across species were detected at three loci, but not at the fourth one. These data 
suggest that amplification of products from non-orthologous loci and appearance of size homoplasy by cross-amplification 
are locus dependent, and do not reflect phylogenetic relationship. Amplification of non-orthologous loci and appearance 
of size homoplasy will lead to obvious complications in phylogenetic interference, population genetic and evolutionary 
studies. Therefore, we propose that sequence analysis of cross-amplification products should be conducted prior to 
application of cross-species amplification of microsatellites. 
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Microsatellites are short tandem repeat DNA 
sequences with the unit length of 1 to 6 base pairs 
(Weber & May, 1989). Because they are highly 
polymorphic, co-dominant in nature, easy to score by 
PCR and rather abundant in most organisms studied, they 
have been widely used for the study of linkage mapping, 
comparative mapping, demographic structure and 
phylogenetic history in populations (Goldstein & 
Schlotterer, 1999; Zhang et al, 2001). However, 
microsatellites are not always available for the species to 
be studied and their isolation could be time-consuming 
(Lin et al, 2008; Wang et al, 2008). In order to save time 
and effort researchers often rely on cross-species 
amplification (Chang et al, 2008; Küpper et al, 2008; 
Kayser et al, 1996; Kijas et al, 1995; Lin et al, 2008). 
This procedure uses PCR primers complementary to the 
flanking regions of loci from a extensively studied 
(source) species to amplify microsatellites from closely 
(Harr et al, 1998) or sometimes quite distantly related 
species (Gonzalez-Martinez et al, 2004) for which no 
such markers are described. One problem related to 
cross-species amplification is size homoplasy 
(Anmarkrud et al, 2008; Estoup et al, 1995). PCR 
products of microsatellite loci with the same fragment 
length, but different sequence can arise from mutational 
events (deletion or insertion) in the flanking regions of 
the repeats or by interruptions in a perfect repeat 
producing alleles of the same size, which however are 
not identical by decent. Microsatellite size homoplasy 
has been reported in a number of papers (Hempel & 
Peakall, 2003; Makova et al, 2000; van Oppen et al, 2000) 
and was thought be a major problem of cross-amplific- 
ation. It seems that size homoplasy increases with time 
divergence among populations and taxa (Estoup et al, 
1995). However, a current study showed that homoplasy 
at microsatellite electromorphs did not represent a 
significant problem for many types of population 
genetics analyses performed by molecular ecologists, as 
the extensive variability at microsatellite loci often 
compensated for their homoplasious evolution (Estoup et 
al, 2002). 

In this paper, we describe a new problem of 
applying microsatellites for several different taxa. 
Cross-species amplification of microsatellites generated 
polymorphic products from non-orthologous loci, which 
were revealed by sequence analysis of 240 clones 
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representing all 50 electromorphs from four loci in seven 
species (Clarias batrachus, C. fuscus, C. gariepinus, C. 
Heterobranchus longfilis, 
pneustes fossilis and Phractocephalus hemioliopterus). 


1 Materials and Methods 


macrocephalus, Hetero- 


1.1 Species and phylogenetic analyses 

Seven species of catfish were used in this study, 
namely: Clarias batrachus (abbreviation: Cba, the 
source species), C. fuscus (Cfu), C. gariepinus (Cga), C. 
macrocephalus (Cma), Heterobranchus longfilis (Hlo), 
Heteropneustes fossilis (Hfo), and Phractocephalus 
(Phe). According to the 
taxonomical system, five of the species studied were 


hemioliopterus current 
from the Clariidae family, one (Heteropneustes fossilis) 
from the Heteropneustidae family, which is closely 
related to Clariidae and the last (P. hemioliopterus) from 
the more distant Pimelodidae family. In order to 
determine the exact evolutionary relationship among the 
seven catfish species, phylogenetic analyses were 
conducted on the basis of the partial sequences of cytb 
genes from their mitochondrial genome. The sequences 
of six species C. batrachus [AF235932], C. fuscus 
[AF416885], C.  gariepinus [AF126823]), C. 
macrocephalus [AJ548464], Heterobranchus longfilis 
[AY995125], and Heteropneustes fossilis [AF126828] 
were downloaded from Genbank, whereas the one of P. 
hemioliopterus was amplified with PCR and sequenced 
as described (Agnese & Teugels, 2005). The sequence of 
the cytb gene of the Asian arowana (Scleropages 
formosus; DQ023143) was used as an outgroup. All 
seven sequences were aligned using Clustal X 
(Thompson et al, 1997), and a NJ tree was reconstructed 
using the Kimura-2 parameter model of nucleotide using 
MEGA 3.0 (Kumar et al, 2001). The partial sequence of 
the cytb gene of P hemioliopterus was deposited in 
GenBank under the accession number DQ200272. 
1.2 Sequencing of electromorphs generated by 
cross-species amplification 

All 50 electromorphs (Tabs. 1—4) generated in an 
earlier study (Yue et al, 2003) from four microsatellites 
(Cba01, Cba03, Cba06 and Cba20) from each of the 
seven species were used for cloning and sequencing. 
PCR products (25 uL) were cleaned using a 
glassmilk-based optimized procedure described earlier 
(Yue et al, 2007; Yue & Orban, 2001) prior to ligation of 


No. 2 


the fragments in to the pGEM-T-Easy vector (Promega) 
and subsequent transformation into XL-10 gold 
ultracompetent cells (Stratagene). Colonies were 
subjected to white/blue selection, and the insert of 
selected white clones was amplified by colony PCR as 
described (Yue et al, 2000). Un-incorporated PCR 
primers were removed by treating 5 uL PCR product for 
each clone with 0.5 unit shrimp alkalic phosphatase 
(SAP; USB) and 0.2 unit Exonuclease I (Exol; USB) in 
1x SAP buffer at 37°C for 30 min, followed by a 
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treatment at 80°C for 15 min to inactivate the enzymes. 
One uL treated PCR product was directly used as 
template for sequencing from both directions using a 
BigDye kit (Applied Biosystems) and either M13 
forward or M13 reverse primer in a PTC-100 PCR 
machine (MJ Research). Electrophoretic separation of 
the sequencing products was performed by using an 
ABI3730xl sequencer (Applied Biosystems). In order to 
exclude the possibility of cloning artifacts, for each 
electromorph from each species, multiple clones (at least 


Tab. 1 Electromorphs amplified by the primer pair designed for Cba01 in seven catfish species 








Electromorph (bp) Species (occurence) GenBank No. (species) 
199 Hfo (12)" AY 196549 (Hfo) 
241 Cba (1), Cma (1) AY 238446 (Cba), AY 196536 (Cma) 
243 Cba (2) AY 196532 (Cba) 
245 Cba (2) AY 196533 (Cba) 
247 Cba (3), Cma (1) AY 196518 (Cba), AY 196520 (Cma) 
249 Cba (2), Cma (2 AY 196534 (Cba), AY 196537 (Cma) 
251 Cma (2), Hlo (7) AY 196538 (Cma), AY 196554 (Hlo) 
253 Cba (2), Cma (1) AY 196535 (Cba), AY 196521 (Cma) 
255 Cfu (4) AY 196542 (Cfu) 
259 Hlo (3) AY 196530 (Hlo) 
261 Hlo (1), Phe (6) AY 196531 (Hlo), AY 196523 (Phe) 
263 Cma (1) AY 196539 (Cma) 

Cfu (10), Cma (1), AY 196543 (Cfu), AY 196540 (Cma), 
mA Hlo (1) AY 196555 (Hlo) 
267 Cma (2), Phe (6) AY 196522 (Cma), AY196547 (Phe) 
269 Cfu (2) AY 196544 (Cfu) 
271 Cfu (7), Cma (1) AY 196519 (Cfu), AY 196541 (Cma) 
277 Cfu (1) AY 196545 (Cfu) 
311 Hfo (1) AY 196550 (Hfo) 
315 Cga (3) AY 196548 (Cga) 
341 Cga (3) AY 196524 (Cga) 
345 Cga (6) AY 196525 (Cga) 
347 Hfo (4), AY 196526 (Hfo) 
349 Hfo (7) AY 196527 (Hfo) 


“This locus appeared to be duplicated in Heteropneustes fossilis, as all individuals contained more 


than two loci. The 199 bp electromorph contained a 150 bp deletion in comparison to the largest 


allele from the same species (Hfo349). 


Tab. 2 Electromorphs amplified by the primer pair designed for Cha03 in seven catfish species 





Electromorph (bp) Species (occurence) GenBank No. (species) 
Cha (12), Cfu (24), AY 196556 (Cba), AY 196557 (Cfu), 
on Cga (12), Hlo (12), AY 196558 (Cga), AY 196563 (Hlo), 
Phe (12),), AY 196564 (Phe) 
Hfo (4) AY 1965561 (Hfo) 
132 Cima (8), Hfo (8) AY 196559 (Cma), AY 196562 (Hfo) 


135 Cma (4) 


AY196560 (Cma) 
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Tab. 3 Electromorphs amplified by the primer pair designed for Cbha06 in seven catfish species 





Electromorph (bp) Species (occurence) GenBank No. (species) 

168 Cga (6), Hlo (12), AY 196572 (Cga), AY 196574 (Hlo), 
172 Cga (6) AY 196573 (Cga) 

174 Phe (12) AY 196578 (Phe) 

199 Hfo (12) AY 196579 (Hfo) 

211 Cha (6) AY 196567 (Cba) 

214 Cha (6) AY 196568 (Cba) 

242 Cima (2) AY 196569 (Cma) 

245 Cfu (24), Cma (5) AY 196571 (Cfu), AY 196570 (Cma) 
248 Cima (3) AY 196580 (Cma) 

255 Cima (1) AY 196581 (Cma) 

258 Cima (1) AY 196582 (Cma) 


Tab. 4 Electromorphs amplified by the primer pair designed for Cba20 in seven catfish species 








Electromorph (bp) Species (occurence) GenBank No. (species) 
93 Cha (1) AY 196596 (Cba) 
95 Cha (3) AY 196584 (Cba) 
99 Cha (1) AY 196597 (Cba) 
109 Cfu (1) AY 196586 (Cfu) 
111 Cfu (1), Hlo (2) AY196585 (Cfu), AY 196593 (Hlo) 
113 Cba (3), Cma (2) AY196598 (Cba), AY 196603 (Cma) 
117 Cga (2) AY196587 (Cga) 
119 Cba (3), Cga (10), Hfo (2),) AY196599 (Cba), AY 196588 (Cga), AY196591 (Hfo) 
121 Cba (1) AY196583 (Cba) 
i Cfu (22), Cma (3), AY196600 (Cfu), AY 196589 (Cma), 
Hlo (2), Hfo (1) AY 196594 (Hlo), AY 196605 (Hfo) 
125 Cma (5), Hlo (2), Hfo (9) AY 196590 (Cma), AY196601 (Hlo), AY 196592 (Hfo) 
129 Cma (2) AY 196604(Cma) 
143 Hlo (6) AY 196602 (Hlo) 
3) were sequenced. Altogether the following number of 2 Results 


clones were sequenced for the four microsatellite types: 
Cba0l - 107 clones, Cbha03 - 20 clones, Cba06 - 50 
clones and Cbha20 - 63 clones. Alignment of sequences 
was carried out by using Clustal X (Thompson et al, 
1997). 


99 












0.02 


Fig. 1 


2.1 Phylogenetic relationship of the seven catfish 
species 

Based on the partial sequences of the cytb gene of 
the seven species, a NJ tree was constructed (Fig. 1). The 


Clarias gariepinus 

Heterobranchus longifilis 
Clarias batrachus 

C. fuscus 

C. macrocephalus 

Heteropneustes fossilis 

Scleropages formosus 


Phractocephalus hemioliopterus 


Phylogenetic relationship among the seven catfish species 


Scale bar (substitution/sites) is shown under the tree, whereas the bootstrap values (>50%) after 1000 replicates are shown on the branches. 
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three species Clarias batrachus, C. fuscus and C. 
macrocephalus were closely related and clustered into a 
group. This group was linked to the group of C. 
gariepinus and Heterobranchus longifilis. The remaining 
two species: Heteropneustes fossilis and 
Phractocephalus hemioliopterus were distantly related to 
other five species. 
2.2 Sequence analysis of electromorphs amplified by 
the Cba01 primer pair 

The primer pair designed to the Cha0/ locus 
amplified polymorphic products in all seven catfish 
species tested. Altogether 23 clear bands (eletromorphs) 
were detected in the seven species (size range: 199-349 
bp), their sequencing analyses uncovered the total of 34 
different alleles (Tab. 1). In C. fuscus, C. macrocephalus 
and P. hemioliopterus both the repeat and the flanking 
regions exhibited high similarity to source sequences 
from C. batrachus (Fig. 2A). On the other hand, the 
corresponding sequences from C. gariepinus, and 
Heteropneustes fossilis species were completely different 
from the source sequences (Fig. 2B), but quite similar 
among these three species. The length of Heterobranchus 
longifilis alleles was similar to those of the source 
species, but the flanking region and repeats were entirely 


different (Fig. 2C). 

The 5' and 3' flanking sequences for each allele were 
nearly identical in different individuals of C. batrachus, 
C. fuscus, C. macrocephalus and P. hemioliopterus, 
respectively. On the other hand, several differences were 
found between sequences from different species both at 
the 5' and 3' flanking regions (seven and eight positions, 
respectively). Most of them seem to have been caused by 
substitution, whereas the rest by insertion or deletion of a 
single base pair. A notable feature is, that the repeat 
structures of this locus were slightly different in these 
four species: (GC),(AC), in the source species, 
(GC)3GT(GC)s-6(AC)s(GC)o-1(AC)n in C. fuscus and P. 
hemioliopterus, whereas (GC),2-5(AC)o-1 
(GC)o-4(AC)o-2GC(AC), in C. macrocephalus (Fig. 2A). 
Therefore, the polymorphism at this locus was caused by 
change in the number of either AC or GC repeat units in 
different species, resulting in fragments of the same 
length, but with quite different sequences. Within species, 
detected in C. 
macrocephalus, but not in the source species, C. fuscus 


size homoplasy could only be 
or P. hemioliopterus. 

In C. gariepinus and H. fossilis, the sequences of 
the 7 electromorphs (Tab. 1) were different from those in 





A 

Cha247 CTATGCGC---------------- 

Cfu271 CTATGCECGCETGCGCGCGCGC--ACACACACACGCACACACACACACACACACACACACACACAABGAC 

Cma247 CTATGCGCACGCGCEC 

Cma253 CTATGCGCACGCGCGC 

Cma267 CTATGCGCGCGCGCACGCACACGCACACACACACACACACACACACACACACACACAC------ AAAGAC 

Phe261 CTATGCGCGCGTGCGCGCGCGC--ACACACACACGCACACACACACACACACAC---------- AAAGAC 
Kkkeeeaeee kkkkkaekeee khkhekekeeeee kaekkeee 

B 

€ga341 GGTTCAGTGG-------- AABBATGTAAGCGATGAATTTABACAGCGGGGAAAGAAGAAG 

Cga345 GGTTCAGTGG-------- AABBATGTAAGCGATGAATTTAAACAGCGGGGAAAGAAGAAG 


Hfo347? GGGTICTGTAA--AAAAAAAAAAATGTAAGC GATGAATTTAAACAGC GGGGAAAGAAGAAG 
Hfo349 GGGTCTGTAAAAAAAAASAAAAATGTAAGC GATGAATTTAAACAGC GGGGAAAGAAGAAG 


++ ++ ++ 
Cga341 AAGAGAAGAAGAGG 
Cga345 AAGAGAAGAAGAGG 





FREER EEE KEE RHEE EEE EEE EEE EEE EEE HEEEEEEEEE 


BAGACCGGGAAA-—--GAAGAGAAGTGG 


AGAGGAGAGAAGACCGGGAAA- —-- GAAGAGAAGTGG 


Hfo347 GAGAGAGGAGAAGGAGAGAGGAGGAGGAAAGGCGAGGAGGGGAAAC GAGGAGAAAAGGAT 
Hf0349 = GAGAGAGGAGAAGGAGAAAGGAGGAGGAGAGGCGAGGAGGGGAAAC GAGGAGAAAAGGAT 


teat ++ ++ ++ 


++ t+++++ * kee KEK 


Cga341 AGACGAGAGGAGGGGGA---TTTTTTTTTCTGAT 
Cga345 AGACGAGAGGAGGGGGA---TTTTTITTTTCTIGAT 
Hfo3A7 AGAAGAGAAGAGGAGGGGGA---TITTICCTGAT 
HfO349 AGAAGAGGAGAGGAGGGGGA---TTITTICCTGAT 


Kee KEE Ketek ++ 


c 
Hlo0o259 
Hlo261 


HERRERA EE 


tetke +++++ 


GTGTATAAATGTG----TGTGTGTGTGTGTTACCTGGTGTGTGCTGTGTGCTGACACA 
GTGTATAAATGTGGTGTTGTGTGTGTGTGTTACCTGOGTGT--GCTGTGTGCTGACACA 


KERR AERA HEHEHE SERRE EEE 


FREER EEE ESE 


Fig. 2 Sequence alignment of some electromorphs (amplified by the primer pair designed for Cha0/) 


from seven different catfish species 


Only the repeat sequences and the ends of the flanking regions adjacent to the repeats from a selected subset of alleles are shown. 


Numbers behind the species abbreviations indicate the allele length. Bold letters highlight the interruptions of the repeats. The flanking 
sequences are indicated in grey. GenBank identifiers from top to bottom: [AY 196518—-AY 196529] and [AY 196530—AY 196531]. A: 
Sequences from Clarias fuscus (Cfu), C. macrocephalus (Cma) and Phractocephalus hemioliopterus (Phe) exhibiting high level of 


similarity to the source species (Cba); B: Sequences from C. gariepinus (Cga), and Heteropneustes fossilis (Hfo) differing from 


sequences in the source species both in the repeat and flanking regions; C: Partial sequences from Heterobranchus longifilis (Hlo). 
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source species. The flanking sequences were quite 
similar among different alleles, although the polyA and 
polyT repeats (located at the 5' and 3' flanking regions, 
respectively) showed polymorphism both within and 
among species (Fig. 2B). Moreover, a deletion of 16 bp 
was detected in the 5' flanking region of Heteropneustes 
fossilis (data not shown). In C. gariepinus (but not in H. 
fossilis) a CAG unit was deleted from the 3' flanking 
region. A few point mutations, short deletions or 
insertions have also been detected in the 5' and 3' 
flanking regions among electromorphs from different 
species (data not shown). Polymorphism in the repeat at 
this locus was caused either by a change in the length of 
polyA stretch in the 5' flanking region, or by the unit 
number of (GA),, (GAA),, (GGA), compound repeats 
or by a deletion of three base pairs CAG and a change in 
the length of the polyT in the 3' flanking region (Fig. 
2B). 

In H. fossilis, the locus appeared to be duplicated, 
because more than two bands were detected in the PCR 
product of each individual tested, whereas no such 
phenomenon was observed in the other two species. The 
199 bp allele from all six individuals of H. fossilis tested 
(Genebank No. AY196549) lacked a 150 bp fragment 
including the 5' flanking region and even the whole 
repeat region as compared with the largest allele (Hfo349) 
(Fig. 2B). 

In Heterobranchus longifilis, the sequences of 
electromorphs were entirely differently from the alleles 
of the source species, although the length of the 
electromorphs was similar to those of the source species 
(Fig. 2C). The 
electromorphs was caused by the change of number of 


length polymorphism of the 


CT repeats. 
2.3 Sequence analysis of electromorphs amplified by 
the Cbha03 primer pair 
At the Cba03 locus, a total of three electromorphs 
(range: 129 — 135 bp) were detected across the seven 


species (Tab. 2). Sequencing of each electromorph (20 
clones) revealed that the sequence of this locus was 
highly conserved across the catfish species studied (Fig. 
3). The polymorphism was caused exclusively by the 
change in the unit number of the (GGA), repeat. At three 
positions of 3' flanking region, single base pair 
substitution was also seen in two species (C. 
macrocephalus and P. hemiolopterus). No size 
homoplasy was identified among individuals of any 


species. 
2.4 Sequence analysis of electromorphs amplified by 
the Cba06 primer pair 


At the Cba06 locus, a total of 11 electromorphs 
(range 168 —258 bp) were identified across the seven 
species (Tab. 3). Their sequence analysis demonstrated 
that they could be divided into two groups and two 
individual sequences (Fig.4A — D). Fragments amplified 
from C. fuscus (1 allele) and C. macrocephalus (4 alleles) 
showed an overall high similarity to the source sequence 
(Fig. 4A). In these two species, an insertion of a 34 bp 
fragment was detected at the 5' flanking region between 
the primer and repeats in every allele in comparison to 
the source sequence. Additional single base pair 
substitutions, located in the flanking regions were also 
found. The length polymorphism was caused by the 
change in the unit number of the (AAC), repeat within 
each species, but among species the length 
polymorphism could also be caused by change in the 
extent of polyA in the 3' flanking region or the insertion 
of a 34 bp fragment into the 5' flanking region. Although 
no size homoplasy was identified within these two 
species, its presence was quite obvious among species. 
For example, the 245 bp electromorph in C. fuscus and 
that in C. macrocephalus showed different unit number 
of CAA-repeats and appearance of a CTA sequence due 
to an A>T mutation in the latter. 

The second group (Fig. 4B) included sequences 


from C. gariepinus (2 alleles), and H. longifilis (1). The 


Cbhal29 BAAGGGAGGAGGAGGAGGA------ AGGGAAAACATGAAGCAGCAGTTTAACTGTA 
cful29 BAGGGAGGAGGAGGAGGA------ AGGGAAAACATGAAGCAGCAGTTTAACTGTA 
Cgal29 RAGGGAGGAGGAGGAGGA------ AGGGAAAACATGAAGCAGCAGTTTAACTGTA 
Cmal32 AAGGGAGGAGGAGGAGGAGGA---AGAGAAAACATGAAGCAGCAGTTTGACTGTA 
€mal35 AAGGGAGGAGGAGGAGGAGGAGGAAGAGAAAACATGAAGCAGCAGTTTGACTGTA 
Hfol29 AAGGGAGGAGGAGGAGGA------ AGGGAAAACATGAAGCAGCAGTTTAACTGTA 
Hfol32 RAAGGGAGGAGGAGGAGGAGGA---AGGGAAAACATGAAGCAGCAGTTTAACTGTA 
Hiol29 AAGGGAGGAGGAGGAGGA------ AGGGAARAACATGAAGCAGCAGTTTAACTGTA 
Phel29 AAGGGAGGAGGAGGAGGA------ AGAGAAAACATGAAGCAGCAGTTTAACTGTA 


Fig. 3 Sequence alignment of some electromorphs (amplified by the primer pair designed for Cha03) 


KEKEKKKKKKEKEEESE KE KBKKEKEKEKKEKEKEKKEEKEEEKEKEKEKEKKEKEEEESE 


from seven different catfish species shows no size homoplasy within or among species 
See Fig. 1 for labeling and other details. GenBank identifiers from top to bottom: [AY 196556—AY 196564]. 
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DNA sequence of the fragments from the two species 
showed high similarity to each other, but differed from 
the source sequence both in their flanking regions and 
repeat motif [(AAC), vs. (CA),]. An insertion of five 
base pairs (CGAAC) was seen in the 5' flanking region 
of the species H. longifilis, as compared the sequences 
from the C. gariepinus (Fig. 4B). Apart from this 
insertion, the length polymorphism was caused by the 
different number of the (AC), repeat units in all 
fragments. Between the two species, single base pair 
substitution was observed at several positions of the 
flanking regions. The 168 bp fragment appeared in both 
species. However comparison of sequences between the 
two species revealed two different alleles. 

The remaining two sequences (Fig. 4C-—D; 
GenBank Nos. AY196578 and AY196579) originated 
from P. hemioliopterus and Heteropneustes fossilis, 
respectively. They did not show any similarity to the first 
two groups except the primer binding sites and did not 
contain repeats. 

2.5 Sequence analysis of electromorphs amplified by 
the Cba20 primer pair 

A total of 13 electromorphs (range: 93 — 143 bp) 
were detected across six species (Tab. 4), but not in P. 


B: 


hemioliopterus. Sequence analysis revealed 10 additional 
alleles (Fig. 5), without any evidence of homoplasy 
within the source species. In the 5' flanking region, 
single base pair substitutions were detected at three 
positions among species. As compared with the source (4 
alleles), sequences from C. macrocephalus (4) and 
Heterobranchus longifilis (3) showed an insertion of 
three basepairs (GTC) in the 3' flanking region. Single 
basepair substitutions were also detected at two positions 
of the 3' flanking regions. The repeat region was highly 
variable within and among species. In the source species, 
repeat structure for the 95 bp allele was (TC).6GC(TC)», 
although longer and shorter alleles showed change in 
repeat number of longer repeat, the GC(TC). motif 
remained constant among all alleles. In C. fuscus (3 
alleles), where the (TC), repeat was interrupted by a TA 
unit, the (TC); upstream from the TA remained 
unchanged, whereas the downstream (TC), repeat 
showed polymorphism among individuals. In C. 
gariepinus (2 alleles) the TC repeats were interrupted by 
GC and TG units at several positions and the 
polymorphism was caused by the change of the long, 
upstream TC repeat, whereas the shorter ones remained 
constant. In C. macrocephalus and Heteropneustes 


Cdeill 
Cbaild 
Cutts 
Catt 


TATU -- ------------------------- --- -- ACU TATIGTIOOTGCEA 
TCATCT-- -- -------------------- --- -- --- -- ACC TATTGTTCCTGGAS 
TCATC TITTTATCTA CAT GTTGATCACT GCACCTGCTTGTACCTATIGTICCTGCAS 
TCATC TITTTATCTA CAT GTTGATCACT GCACCTGCTTGTACCTSTITGTICCTGCAS 


CrazidS TCATTTGTTTATC TACATCTTGATCACTOGACC TGCTIGTACC TATTGTTCOTGCAS 


RRKRK NX 


RARRARRRAR AAR ARAN 


Cbaitll TAa&C------CSTTTACASCAACASC AAC AS CAACAS-- -- --SCOTCT 
Cbeil4 TA&C------CATTTACASCAACAAC ASC AS CAR CSACSS- --ACTCT 
CfuitS TAC-34bp-CATTTSA CAS CASCAAL AAC AA CSS -- --- -AdRACTCT 
Ctt? TAC-3 dbp-CATTTACASCASCAAC AAC AS --- ----- -ASRACTCT 
Catts TAC-3 dbp-AdTTTS CAS CASCAAC BAC BACTH-- --- -SRRRCTCT 


RAR RARRARARARARRRARRARARRARNR 
B: 
Canlbs 
Canl?i 
Mlolbs -t 
Mlo163-1 

ETETEN 

C: 


RARRR 


&CGAGEE----- SR E0TAC GRACE CACTI TCAGGCCASCACSCACAC --- COCAGCTCACGTTAAT 
&CGAGRC-- -- SBA CTACCAACACACTITOACCCAACACACACACACSC CC CAGTCACGTTSAT 
ATCA 63 CCCSA CARACTAT CAAGACACTTACAGACAACSCACAL -- --- CCCAGCTCACGTTSAT 


&7G8, 08 CCCS3 CARACTAT GASGACSCTTAGAGACAACSCACAL -- --- COCAGTCACGTTSAT 
ERRERA RARAARRAERRARRR RRRRRRRR 


RRARRARRRARRR ARK 


Phel? 4: CARSC COC TEGCATASCACACT OC CGTTSAGCGCATCCTTCGACTC COTCOASACATASATCTATCCASATOTS GCS 
SCTATACTATCATCAGSTCATACTSC TITAC TAGCSGT OC TCASASCTTGCCGOTGTTATATTCOSTACSCCAC GTTSTGTGOTS 


TICGCTCACCCTITC 
D . 


M0199: CARSC CEC TE GLATASCACS CGAL COATTASCTC COGATTC GGCATGTGTGTATASTCTCAGCAGCTCATTCTCTTS 
TCCACTTSCATCSAAC TTGTCGACGTOT GL TGTATSS CATCS GCS CT CTASSTASTTACACC ST OTGOSCGTCAGCTGC GTCOGCT 


TOTTATCTTCATCITTCATGTGTATIC GCTCACCCTITC 


Fig.4 Sequence alignment of some electromorphs 


from seven different catfish species 


(amplified by the primer pair designed for Cha06) 


See Fig. 1 for labeling and other details. GenBank identifiers from top to bottom: [AY 196567— AY 196575] and [AY 196578—AY 196579]. A: 
Alignment of the sequences from Clarias batrachus (Cba), C. fuscus (Cfu) and C. macrocephalus (Cma); B: Alignment of the sequences 


from C. gariepinus (Cga), and Heterobranchus longifilis (Hlo). Alleles Hlo168-1 and Hlo168-2 were amplified from two different 


individuals of H. longifilis, C: The 174 bp allele from Phractocephalus hemioliopterus (Phe). Underlining indicates the sequences of 


primers designed to match the flanking regions in the source species and used for cross-species amplification; D: The 199 bp allele from 


Heteropneustes fossilis (Hfo). Underlining indicates the sequences of primers designed to match the flanking regions in the source species 


and used for cross-species amplification. 
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ChalZ1 ATTGCGTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCGECTCTC-- TTTTCACCTCACCTGCTA---CTCC 
Cha95 ATIGCETCTCTCTCTCTC-------------------------- GCTCTC--TITTCACCTCACCTGCTA---CTCc 
Cfulll ATTGCGTCTCTCTATCTCTCTICTCTCTCTCTCTCTC TC TC------------ TITTCACCTCACCTGCTA---CTCC 
c£ul09 ATTGCGTCTCTCTATCTCTCTCTCTCTCTCTCTCTCTC-------------- TITTCACCTCACCTGCTA---CTCC 
Cgal17? ATTGCTICTCTCTICTCTCTCICTCTCECTCTCTCECTCTCTGETGTC------ TITTICAGCTCACCTGCTA---GTCC 
Cga119 ATTGCTTICTCTCTCTCTCTCTCTCTCTCECTCTCTCECTCTCTGTGTC----TITTCAGCTCACCTGCTA---GTCC 
Cmal23 ATIGCTGCTCTCTCTCTCTCTICTCTCTCCCTCTTTCTCTCETTC Te TC- --- TTTTCACCTCACCTGCTAGTCCTCC 
Cma125 ATTGCTGCTCTCTCTCTCTCTCTCTCTCTCTCCCTCTTTCTCTCGT TCTC--TTTTCACCTCACCTGCTAGTCCTCC 
Hfo119 ATTGCTGCTCTCTCTCTCTCTCTCCCTCTTTCTCTCGTTCTCTC-------- TITTCACCTCACCTGCTAGTCCTCC 
Hfo12Z25 ATTGCTGCTCTCTCTCTCTCTCTCTCCCTCCCTCTTTCTCTCET TC TCTC--TTTICACC TCACCTGCTAGTCCTCC 
Hiol1ll ATTGCTTCTCTCTCTCTCTCTCTCGCTCTCTCTCAGTCTC------------ TITTCACCTCACCTGCTA---GTCC 
Hlo123 ATTGCTTCTCTCTCTCTCTCTCTCTCTCECECTCTCTCECTCTCTGTETCTCTTTTICAGC TCACCTGCTA---GTCC 


Athhe tthe et +++ tet the atte ee ete +++ 


Fig.5 Sequence alignment of some electromorphs (amplified by the primer pair Cba20) 


from seven different catfish species 
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See Fig. 1 for labeling and other details. GenBank identifiers from top to bottom: [AY 196583-AYAY 59694]. 


fossilis (3 alleles), the (TC), repeat was interrupted by 
CC, TT and GT motifs, whereas in Heterobranchus 
longifilis by GC, AG and TG units. The reason for the 
polymorphism was similar to that described for C. 
gariepinus. 


3 Discussion 


Microsatellites are very useful tools for genetic and 
evolutionary studies. However, their genotyping is based 
on prior sequence information from the genome to be 
analyzed. Despite of recent improvements on the 
procedure (for review see: Zane et al, 2002) the isolation 
of microsatellites is still cumbersome. One of the 
possible solutions for this problem is cross-species 
amplification, which involves the use of primer pairs 
designed for the flanking region of conserved 
microsatellites (of a so-called source species) for 
genotyping in related species amplification (Housley et 
al, 2006; Kayser et al, 1996; Kijas et al, 1995). Data for 
several such experiments have been reported in teleosts 
during the last decade (e.g. Koskinen & Primmer, 1999; 
Yue et al, 2004; Yue et al, 2003). However all PCR 
products generated in the non-source species have only 
been analyzed at the sequence level in a few cases 
(Kayang et al, 2002; Viard et al, 1998). We have tested 
the applicability of four conserved microsatellite markers 
isolated earlier from C. batrachus (Yue et al, 2003) on 
six additional catfish species. We found that PCR primer 
pairs designed for the flanking regions of the four C. 
batrachus microsatellite loci amplified products in most 
of the related species. However, sequencing analyses of 
240 clones representing 50 electromorphs from seven 
catfish species revealed a new problem of cross-species 
amplification of microsatellites: the generation of 
non-orthologous loci, beside the appearance of size 
homoplasy. Primer pairs designed for two C. batrachus 
loci (Cba03 and Cba20) amplified highly similar 
(orthologous) sequence products in all non-source 


species. On the other hand, those designed for other two 
loci (Cba01 and Cba06) yielded polymorphic products 
with entirely different sequence from some of the 
distantly related species (e.g. P hemioliopterus and 
Heteropneustes fossilis), and even in closely related 
species (e.g. C. gariepinus) indicating that these bands 
originated from non-orthologous loci. The amplification 
of specific products from non-orthologous source was 
locus-dependent, and did not reflect the phylogenetic 
relationship. Thus, in the absence of sequence 
information it would be very difficult to predict whether 
certain primer pairs will amplify products from 
orthologous loci in a given non-source species or not. 
Similar phenomenon was observed earlier in soybean 
(Peakall et al, 1998) and rice (Chen et al, 2002), but 
those findings have not been analyzed in detail. Taken 
together, our data suggest that generation of polymorphic 
products from non-orthologous loci by cross-species 
amplification is not a unique feature of certain taxonomic 
groups in fish, instead it might occur throughout the 
animal and plant kingdom. Although the mechanisms 
underlying this phenomenon are not fully understood, 
they are thought to be related to genome and gene 
duplication, as well as speciation. Such events are 
expected be more frequent in fish, since the ancestor of 
today’s teleosts seems to have experienced an additional 
round of genome duplication (Meyer & Schartl, 1999; 
Postlethwait et al, 2000) and chromosome duplications 
(Chang et al, 2005) after their ancestor has split from that 
of the other vertebrates. Duplication of microsatellite loci 
followed by gene conversion can lead to amplification of 
non-orthologous loci as proposed (Angers et al, 2002). 
Sequencing of all alleles of four microsatellite loci 
in the source and six non-source species showed that 
length difference of microsatellites was not restricted to 
their repeat regions. A longer insertion and several 
shorter insertions were detected in the flanking region of 
the loci orthologous to Cba06 in non-source species. At 
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the Cha01, Cba06 and Cba20 loci, a number of alleles 
from different non-source species showed the same 
length, but with different sequences. At the same time, at 
Cba03 
represented the same sequences, suggesting that size 


locus electromorphs of the same length 
homoplasy for microsatellite markers produced by 
cross-species amplification is locus-dependent, it does 
not reflect the phylogenetic relationship. We also found 
the tendency of increase in the number of interrupted 
repeats of orthologous loci in non-source species, as 
observed by others in different taxonomic groups (e.g. 
Culver et al, 2001; Di Gaspero et al, 2000; Estoup et al, 
1995; Garza et al, 1995; van Oppen et al, 2000). This 
tendency also seems to be locus-dependent in catfish, 
since two loci (Cba0] and Cba20) showed clear 
interruptions in non-source species, whereas the other 
two (Cba03 and Cba06) 


interruptions in them. 


exhibited no or few 


Applications of microsatellites to population 
genetics, ecological and evolutionary studies rely heavily 
on the models used for explaining the mutational process 
of these markers. However, all models relay on the 
assumption that differences between alleles at 
orthologous loci are due entirely to changes in the 
number of repeats. In this study, we demonstrated that 
appearance of size homoplasy and amplification of 
non-orthologous products by cross-species amplification 
were locus-dependent, and did not reflect phylogenetic 
relationships. Therefore, application of cross amplific- 


References: 


Agnese, JF, Teugels, GG. 2005. Insight into the phylogeny of African 
Clariidae (Teleostei, Siluriformes): Implications for their body 
shape evolution, biogeography, and taxonomy [J]. Mol Phylogenet 
Evol, 36 (3): 546-553. 

Angers B, Gharbi K, Estoup A. 2002. Evidence of gene conversion 
events between paralogous sequences produced by 
tetraploidization in Salmoninae fish [J]. J Mol Evol, 54 (4): 
501-510. 

Anmarkrud JA, Kleven O, Bachmann L, Lifjeld JT. 2008. 
Microsatellite evolution: Mutations, sequence variation, and 
homoplasy in the hypervariable avian microsatellite locus HrU10 
[J]. BMC Evol Biol, 8: 138. 

Chang CH, Hsieh LC, Chen TY, Chen HD, Luo L, Lee HC. 2005. 
Shannon information in complete genomes [J]. J Bioinform 
Comput Biol, 3 (3): 587-608. 

Chang YM, Kuang YY, Liang LQ, Lu CY, He JG, Su XW. 2008. 
Searching for protein-coding genes using microsatellites in 
common carp by comparing to zebrafish EST database [J]. Zool 
Res, 29 (4): 373-378. 

Chen X, Cho YG, McCouch SR. 2002. Sequence divergence of rice 
microsatellites in Oryza and other plant species [J]. Mol Genet 


ation of microsatellites to population genetics and 
phylogenetic analyses in distantly or even in closely 
related species, might make the interpretation of length 
difference of electromorphs difficult and cause wrong 
estimation of evolutionary relationship. 

In conclusion, we revealed a new problem of 
microsatellite cross-species amplification, namely 
amplification of non-orthologous loci, besides the 
well-known problem (size homoplasy). The new problem 
and appearance of size homoplasy will lead to obvious 
complications for phylogenetic interferences, population 
studies. The 


generated by 


genetics, mapping and evolutionary 


sequence analysis of products 


“cross-species primers” should always be performed, 


as it could reveal previously unrecognized problems and 
might allow for extracting more information from these 
loci, thereby increasing their usefulness. 


Acknowledgments: The authors would like to 
thank Drs. Graham Mair, Arlo Fast, Bela Urbanyi and 
Lian Chuan Lim, as well as Ferenc Radics, Judit 
Raczkevi and Gyula Pasareti for supplying fin clips from 
various catfish species, and the Strategic Research 
Program of TLL for financial support. B.K. is grateful 
for the support of the Temasek Life Sciences Laboratory, 
the Bolyai Research Fellowship of the Hungarian 
Academy of Sciences, and the Hungarian Scientific 
Research Fund (OTKA PD79177).. 


Genomics, 268 (3): 331-343. 

Culver M, Menotti-Raymond MA, O'Brien SJ. 2001. Patterns of size 
homoplasy at 10 microsatellite loci in pumas (Puma concolor) [J]. 
Mol Biol Evol, 18 (6): 1151-1156. 

Di Gaspero G, Peterlunger E, Testolin R, Edwards KJ, Cipriani G. 2000. 
Conservation of microsatellite loci within the genus Vitis [J]. 
Theor Appl Genet, 101 (1-2): 301-308. 

Estoup A, Jarne P, Cornuet JM. 2002. Homoplasy and mutation model 
at microsatellite loci and their consequences for population 
genetics analysis [J]. Mol Ecol, 11 (9): 1591-1604. 

Estoup A, Tailliez C, Cornuet JM, Solignac M. 1995. Size homoplasy 
and mutational processes of interrupted microsatellites in two bee 
species, Apis mellifera and Bombus terrestris (Apidae) [J]. Mol 
Biol Evol, 12 (6): 1074-1084. 

Garza JC, Slatkin M, Freimer NB. 1995. Microsatellite allele 
frequencies in humans and chimpanzees, with implications for 
constraints on allele size [J]. Mol Biol Evol, 12 (4): 594-603. 

Goldstein DB, Schlotterer C. 1999. Microsatellites: Evolution and 
Applications [M]. Oxford: Oxford University Press. 

Gonzalez-Martinez SC, Robledo-Arnuncio JJ, Collada C, Diaz A, 
Williams CG, Alia R, Cervera MT. 2004. Cross-amplification and 


140 Zoological Research 


sequence variation of microsatellite loci in Eurasian hard pines [J]. 
Theor Appl Genet, 109 (1): 103-111. 

Harr B, Zangerl B, Brem G, Schlotterer C. 1998. Conservation of 
locus-specific microsatellite variability across species: A 
comparison of two Drosophila sibling species, D. melanogaster 
and D. simulans [J]. Mol Biol Evol, 15 (2): 176-184. 

Hempel K, Peakall R. 2003. Cross-species amplification from crop 
soybean Glycine max provides informative microsatellite markers 
for the study of inbreeding wild relatives [J]. Genome, 46 (3): 
382-393. 

Housley DJ, Zalewski ZA, Beckett SE, Venta PJ. 2006. Design factors 
that influence PCR amplification success of cross-species primers 
among 1147 mammalian primer pairs [J]. BMC Genomics, 7: 253. 

Küpper C, Burke T, Székely T, Dawson DA. 2008. Enhanced 
cross-species utility of conserved microsatellite markers in 
shorebirds [J]. BMC Genomics, 9: 502. 

Kayang BB, Inoue-Murayama M, Hoshi T, Matsuo K, Takahashi H, 
Minezawa M, Mizutani M, Ito S. 2002. Microsatellite loci in 
Japanese quail and cross-species amplification in chicken and 
guinea fowl [J]. Genet Sel Evol, 34 (2): 233-253. 

Kayser M, Ritter H, Bercovitch F, Mrug M, Roewer L, Nurnberg P. 
1996. Identification of highly polymorphic microsatellites in the 
rhesus macaque Macaca mulatta by cross-species amplification 
[J]. Mol Ecol, 5 (1): 157-159. 

Kijas JM, Fowler JC, Thomas MR. 1995. An evaluation of sequence 
tagged microsatellite site markers for genetic analysis within 
Citrus and related species [J]. Genome, 38 (2): 349-355. 

Koskinen MT, Primmer CR. 1999. Cross-species amplification of 
salmonid microsatellites which reveal polymorphism in European 
and Arctic grayling, Salmonidae: Thymallus spp [J]. Hereditas, 
131 (2): 171-176. 

Kumar S, Tamura K, Jakobsen IB, Nei M. 2001. MEGA2: molecular 
evolutionary genetics analysis software [J]. Bioinformatics, 17: 
1244-1245. 

Lin G Chang A, Yap W, Yue GH. 2008. Characterization and 
cross-species amplification of microsatellites from the endangered 
Hawksbill turtle (Eretmochelys imbricate) [J]. Conserv Genet, 9: 
1071-1073. 

Makova KD, Nekrutenko A, Baker RJ. 2000. Evolution of 


microsatellite alleles in four species of mice (genus Apodemus) [J]. 


J Mol Evol, 51 (2): 166-172. 

Meyer A, Schartl M. 1999. Gene and genome duplications in 
vertebrates: the one-to-four (- to-eight in fish) rule and the 
evolution of novel gene functions [J]. Curr Opin Cell Biol, 11 (6): 
699-704. 

Peakall R, Gilmore S, Keys W, Morgante M, Rafalski A. 1998. 


Vol. 31 


Cross-species amplification of soybean (Glycine max) simple 
sequence repeats (SSRs) within the genus and other legume 
genera: Implications for the transferability of SSRs in plants [J]. 
Mol Biol Evol, 15 (10): 1275-1287. 

Postlethwait JH, Woods IG, Ngo-Hazelett P, Yan YL, Kelly PD, Chu F, 
Huang H, Hill-Force A, Talbot WS. 2000. Zebrafish comparative 
genomics and the origins of vertebrate chromosomes [J]. Genome 
Res, 10 (12): 1890-1902. 

Thompson JD, Gibson TJ, Plewniak F, Jeanmougin F, Higgins DG. 
1997. The CLUSTAL_X windows interface: flexible strategies for 
multiple sequence alignment aided by quality analysis tools [J]. 
Nucleic Acids Res, 25 (24): 4876-4882. 

van Oppen MJH, Rico C, Turner GF, Hewitt GM. 2000. Extensive 
homoplasy, nonstepwise mutations, and shared ancestral 
polymorphism at a complex microsatellite locus in Lake Malawi 
cichlids [J]. Mol Biol Evol, 17 (4): 489-498. 

Viard F, Franck P, Dubois MP, Estoup A, Jarne P. 1998. Variation of 
microsatellite size homoplasy across electromorphs, loci, and 
populations in three invertebrate species [J]. J Mol Evol, 47 (1): 
42-51. 

Wang HZ, Yin QQ, Feng ZG, Li, DY, Sun XW, Li C. 2008. 
Construction of fractional genomic libraries and screening 
microsatellites DNA of Esox reieherti Dybowski [J]. Zool Res, 29 
(3): 245-252. 

Weber JL, May PE. 1989. Abundant class of human DNA 
polymorphisms which can be typed using the polymerase 
chain-reaction [J]. Am J Hum Genet, 44 (3): 388-396. 

Yue GH, Chen F, Orban L. 2000. Rapid isolation and characterization 
of microsatellites from the genome of Asian arowana (Scleropages 
formosus, Osteoglossidae, Pisces) [J]. Mol Ecol, 9 (7): 1007-1009. 

Yue GH, David L, Orban L. 2007. Mutation rate and pattern of 
microsatellites in common carp (Cyprinus carpio L.) [J]. Genetica, 
129 (3): 329-31. 

Yue GH, Ho MY, Orban L, Komen J. 2004. Microsatellites within 
genes and ESTs of common carp and their applicability in silver 
crucian carp [J]. Aquaculture, 234 (1-4): 85-98. 

Yue GH, Kovacs B, Orban L. 2003. Microsatellites from Clarias 
batrachus and their polymorphism in seven additional catfish 
species [J]. Mol Ecol Notes, 3 (3): 465-468. 

Yue GH, Orban L. 2001. Rapid isolation of DNA from fresh and 
preserved fish scales for polymerase chain reaction [J]. Mar 
Biotechnol, 3 (3): 199-204. 

Zane L, Bargelloni L, Patarnello T. 2002. Strategies for microsatellite 
isolation: a review [J]. Mol Ecol, 11 (1): 1-16. 

Zhang YW, Zhang YP, Aryder O. 2001. Microsatellites and its 
application [J]. Zool Res, 22 (4): 315-320. 


